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SYSTEM AND METHOD FOR SELECTIVE 
DECODING AND DECOMPRESSION 

BACKGROUND OF THE INVENTION 

Technical Field: 

This invention is directed towards a system and process for selectively 
decoding and decompressing portions of the frames of a panoramic video. 

Background Art: 

Panoramic video is constructed of a sequence of frames, each of which 
depicts a 360 degree view of the surrounding scene or some significant portion 
thereof. These frames are played in a panoramic video viewer in sequence to 
produce a video of the scene. A panoramic video viewer may allow a user to 
navigate through the scene by processing user commands to pan right, left, up or 
down. In other words, a person viewing the panoramic video can electronically 
steer his or her viewpoint around in the scene as the video is playing. Such a 
panoramic viewer is the subject of a co-pending application entitled "A System and 
Process for Viewing Panoramic Video", which has the same inventors as this 
application and which is assigned to a common assignee. The co-pending 

application was filed on and assigned serial number 

. The disclosure of this co-pending application is hereby 

incorporated by reference. 
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As can be envisioned from the above discussion of viewing a frame of a 
panoramic video, only a portion of the overall image is displayed to the user. Thus, 
much of the overall frame is not viewed at any one time. However, typically the 
entire panoramic video frame is input and processed by the viewer. 

In general, the transmission and storage of panoramic view frames present 
difficulties due to the amount of information they contain. In the case where these 
frames are transferred to the viewer over a network, such as the Internet, they will 
typically be compressed in some way. Unfortunately, even in a compressed form 
these frames represent a considerable amount of data, and so present problems in 
transmitting them to the viewer, as well as processing and storing them once 
received. These large files are slow to be transferred to a viewer. Additionally, to 
process this image data in real-time requires large amounts of Random Access 
Memory (RAM) as well as large powerful processors. . Even in a case where the 
frames are input to the viewer directly from a storage medium, such as a hard 
drive, CD, DVD, or the like, their size, especially if not compressed, imposes 
considerable storage and processing requirements on the viewer. 

SUMMARY 

The present invention overcomes the aforementioned limitations with a 
system and process that segments the panoramic video frames, thereby allowing 
selective decoding of just those specific regions that are to be viewed. Specifically, 
each frame is segmented into a plurality of regions. The frames are segmented in 
the same way such that the segmented regions correspond from one frame to the 
next. Each segmented region is then optionally compressed and encoded 
separately. Thus, separate video streams are generated for each of the 
segmented regions of each panoramic video frame. 

Once the panoramic video frames have been segmented, compressed (if 
desired), and encoded, they are ready for transfer to the viewer. This can be 
accomplished in a number of ways, each with particular advantages. One way to 
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transfer the frames involves an interactive approach. Essentially, the viewer, such 
as the one described in the aforementioned co-pending application, identifies the 
portions of the scene the user is currently viewing. In the case of a network 
connection, the viewer then informs a server of the segments of the next frame of 
5 the video that are needed to render the desired view of the scene to the user. The 
server then transfers only the requested segments of the next panoramic video 
frame to the viewer. This process is repeated for each frame of the panoramic 
video. 



10 This foregoing interactive embodiment has the advantages of preserving the 

bandwidth utilized when sending data from the server to the viewer since only the 
data actually used by the viewer is transmitted. In addition, the processing and 
storage requirements of the viewer are minimized, as only those portions of each 
frame that are needed have to be decoded, decompressed and stored. 

15 

Of course, in some circumstances an interactive approach will not be desired 
or possible. In such cases, the system and process of the present invention still 
has advantages. Granted, all the segmented regions of each panoramic frame 
must be sent to the viewer as there is no feedback as to which regions are needed. 
20 However, once received, the viewer can selectively process and decompress (if 
necessary) only those segments required to display the portion of the scene 
currently being viewed by the user. Thus, the processing and storage requirements 
of the viewer are minimized. 



25 This panoramic video segmentation technique according to the present 

invention also has some of the same advantages when employed with a direct 
connection between the viewer and some type of storage media. Specifically, the 
segmented panoramic video frames are stored on a storage medium (e.g., hard 
drive, CD, DVD) to which the viewer has direct access. Thus, the viewer can 

30 determine which segments of each panoramic video frame are needed to produce 
the desired view to the user, and reads only these segments from the storage 
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medium. In this way the processing and storage requirements of the viewer are 
minimized. 

In regard to the encoding of the panoramic frame segments, each frame 
5 segment is appended with an identifier that identifies what frame and what frame 
segment "location" (i.e., what region of the panoramic frame) the accompanying 
image data relates to. A separate file can be created for each video stream 
corresponding to a certain frame segment region. Alternately, one file could be 
created for all segment regions, with separate frames and frame segments being 
10 identified by the aforementioned identifiers. 
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DESCRIPTION OF THE DRAWINGS 

The specific features, aspects, and advantages of the present invention will 
become better understood with regard to the following description, appended 
claims and accompanying drawings where: 

FIG. 1 is a diagram depicting a general purpose computing device 
constituting an exemplary system for implementing the present invention. 



FIG. 2 is a flow diagram of a method for segmenting and encoding a video 
25 frame according to the present invention. 

FIG. 3A is a diagram depicting a segmented panoramic video frame. 

FIG. 3B is a diagram depicting a possible file structure for use with the 
30 present invention. 

FIG. 4 is a flow diagram outlining transfer of video frames to a viewer in a 
server - client environment. 
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FIG. 5 is a flow diagram outlining transfer of video frames to a viewer that is 
not in a server-client environment. 

FIG. 6 is a flow diagram outlining an example of the panoramic video 
segmentation technique in accordance with the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following description of the preferred embodiments of the present 
invention, reference is made to the accompanying drawings, which form a part 
hereof, and which is shown by way of illustration of specific embodiments in which 
the invention may be practiced. It is understood that other embodiments may be 
utilized and structural changes may be made without departing from the scope of 
the present invention. 

Exemplary Operating Environment : 

FIG. 1 illustrates an example of a suitable computing system environment 
100 on which the invention may be implemented. The computing system 
environment 100 is only one example of a suitable computing environment and is 
not intended to suggest any limitation as to the scope of use or functionality of the 
invention. Neither should the computing environment 100 be interpreted as having 
any dependency or requirement relating to any one or combination of components 
illustrated in the exemplary operating environment 100. 

The invention is operational with numerous other general purpose or special 
purpose computing system environments or configurations. Examples of well 
known computing systems, environments, and/or configurations that may be 
suitable for use with the invention include, but are not limited to, personal 
computers, server computers, hand-held or laptop devices, multiprocessor 




systems, microprocessor-based systems, set top boxes, programmable consumer 
electronics, network PCs, minicomputers, mainframe computers, distributed 
computing environments that include any of the above systems or devices, and the 
like. 
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The invention may be described in the general context of computer- 
executable instructions, such as program modules, being executed by a computer. 
Generally, program modules include routines, programs, objects, components, data 
structures, etc. that perform particular tasks or implement particular abstract data 
10 types. The invention may also be practiced in distributed computing environments 
where tasks are performed by remote processing devices that are linked through a 
communications network. In a distributed computing environment, program 

Q modules may be located in both local and remote computer storage media 

£ including memory storage devices. 

M. 15 

!l With reference to FIG. 1 , an exemplary system for implementing the 

*F invention includes a general purpose computing device in the form of a computer 

7 110. Components of computer 110 may include, but are not limited to, a 

^ processing unit 120, a system memory 130, and a system bus 121 that couples 

O 20 various system components including the system memory to the processing unit 

JJ 120. The system bus 121 may be any of several types of bus structures including a 

P memory bus or memory controller, a peripheral bus, and a local bus using any of a 

variety of bus architectures. By way of example, and not limitation, such 

architectures include Industry Standard Architecture (ISA) bus, Micro Channel 
25 Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards 

Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus 

also known as Mezzanine bus. 



Computer 110 typically includes a variety of computer readable media. 
30 Computer readable media can be any available media that can be accessed by 
computer 110 and includes both volatile and nonvolatile media, removable and 
non-removable media. By way of example, and not limitation, computer readable 
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media may comprise computer storage media and communication media. 
Computer storage media includes both volatile and nonvolatile, removable and 
non-removable media implemented in any method or technology for storage of 
information such as computer readable instructions, data structures, program 
modules or other data. Computer storage media includes, but is not limited to, 
RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, 
digital versatile disks (DVD) or other optical disk storage, magnetic cassettes, 
magnetic tape, magnetic disk storage or other magnetic storage devices, or any 
other medium which can be used to store the desired information and which can 
accessed by computer 110. Communication media typically embodies computer 
readable instructions, data structures, program modules or other data in a 
modulated data signal such as a carrier wave or other transport mechanism and 
includes any information delivery media. The term "modulated data signal" means 
a signal that has one or more of its characteristics set or changed in such a manner 
as to encode information in the signal. By way of example, and not limitation, 
communication media includes wired media such as a wired network or direct-wired 
connection, and wireless media such as acoustic, RF, infrared and other wireless 
media. Combinations of the any of the above should also be included within the 
scope of computer readable media. 

The system memory 130 includes computer storage media in the form of 
volatile and/or nonvolatile memory such as read only memory (ROM) 131 and 
random access memory (RAM) 132. A basic input/output system 133 (BIOS), 
containing the basic routines that help to transfer information between elements 
within computer 110, such as during start-up, is typically stored in ROM 131. RAM 
132 typically contains data and/or program modules that are immediately 
accessible to and/or presently being operated on by processing unit 120. By way of 
example, and not limitation, FIG. 1 illustrates operating system 134, application 
programs 135, other program modules 136, and program data 137. 

The computer 110 may also include other removable/non-removable, 
volatile/nonvolatile computer storage media. By way of example only, FIG. 1 



illustrates a hard disk drive 141 that reads from or writes to non-removable, 
nonvolatile magnetic media, a magnetic disk drive 151 that reads from or writes to 
a removable, nonvolatile magnetic disk 152, and an optical disk drive 155 that 
reads from or writes to a removable, nonvolatile optical disk 156 such as a CD 
ROM or other optical media. Other removable/non-removable, volatile/nonvolatile 
computer storage media that can be used in the exemplary operating environment 
include, but are not limited to, magnetic tape cassettes, flash memory cards, digital 
versatile disks, digital video tape, solid state RAM, solid state ROM, and the like. 
The hard disk drive 141 is typically connected to the system bus 121 through an 
non-removable memory interface such as interface 140, and magnetic disk drive 
151 and optical disk drive 155 are typically connected to the system bus 121 by a 
removable memory interface, such as interface 150. 

The drives and their associated computer storage media discussed above 
and illustrated in FIG. 1 , provide storage of computer readable instructions, data 
structures, program modules and other data for the computer 110. In FIG. 1 , for 
example, hard disk drive 141 is illustrated as storing operating system 144, 
application programs 145, other program modules 146, and program data 147. 
Note that these components can either be the same as or different from operating 
system 134, application programs 135, other program modules 136, and program 
data 137. Operating system 144, application programs 145, other program 
modules 146, and program data 147 are given different numbers here to illustrate 
that, at a minimum, they are different copies. A user may enter commands and 
information into the computer 110 through input devices such as a keyboard 162 
and pointing device 161, commonly referred to as a mouse, trackball or touch pad. 
Other input devices (not shown) may include a microphone, joystick, game pad, 
satellite dish, scanner, or the like. These and other input devices are often 
connected to the processing unit 120 through a user input interface 160 that is 
coupled to the system bus 121, but may be connected by other interface and bus 
structures, such as a parallel port, game port or a universal serial bus (USB). A 
monitor 191 or other type of display device is also connected to the system bus 121 
via an interface, such as a video interface 190. In addition to the monitor, 
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computers may also include other peripheral output devices such as speakers 197 
and printer 196, which may be connected through an output peripheral interface 
195. Of particular significance to the present invention, a camera 163 (such as a 
digital/electronic still or video camera, or film/photographic scanner) capable of 
5 capturing a sequence of images 164 can also be included as an input device to the 
personal computer 110. Further, while just one camera is depicted, multiple 
cameras could be included as an input devices to the personal computer 110. The 
images 164 from the one or more cameras are input into the computer 1 10 via an 
appropriate camera interface 165. This interface 165 is connected to the system 
10 bus 121, thereby allowing the images to be routed to and stored in the RAM 132, or 
one of the other data storage devices associated with the computer 110. However, 
it is noted that image data can be input into the computer 110 from any of the 
q aforementioned computer-readable media as well, without requiring the use of the 

5 camera 163. 

gi 

\A 15 

C - 

^ The computer 110 may operate in a networked environment using logical 

=p connections to one or more remote computers, such as a remote computer 180. 
J* The remote computer 180 may be a personal computer, a server, a router, a 
B network PC, a peer device or other common network node, and typically includes 
O 20 many or all of the elements described above relative to the computer 110, although 
Jz only a memory storage device 181 has been illustrated in FIG. 1 . The logical 
O connections depicted in FIG. 1 include a local area network (LAN) 171 and a wide 
area network (WAN) 173, but may also include other networks. Such networking 
environments are commonplace in offices, enterprise-wide computer networks, 
25 intranets and the Internet. 



When used in a LAN networking environment, the computer 1 1 0 is 
connected to the LAN 171 through a network interface or adapter 170. When used 
in a WAN networking environment, the computer 110 typically includes a modem 
30 1 72 or other means for establishing communications over the WAN 173, such as 
the Internet. The modem 172, which may be internal or external, may be 
connected to the system bus 121 via the user input interface 160, or other 
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appropriate mechanism. In a networked environment, program modules depicted 
relative to the computer 1 10, or portions thereof, may be stored in the remote 
memory storage device. By way of example, and not limitation, FIG. 1 illustrates 
remote application programs 185 as residing on memory device 181. It will be 
5 appreciated that the network connections shown are exemplary and other means of 
establishing a communications link between the computers may be used. 



The exemplary operating environment having now been discussed, the 
remaining parts of this description section will be devoted to a description of the 
10 program modules embodying the invention. 



Frame Segmenting. Encoding and Compression 

n 

'£ As shown in FIG. 2, the system and method according to the present 

15 invention involves segmenting the panoramic video frames, thereby allowing 

J selective decoding and possibly decompression of just those specific regions that 

=P are to be viewed. Specifically, each frame is segmented into a plurality of regions 

T (process action 200). Each frame is segmented in the same way such that the 

p; segmented regions correspond from one frame to the next. Each segmented 

□ 20 region is then encoded separately, as shown in process action 202. Further, each 

2 segment of the panoramic video frames can be optionally compressed, as shown in 

O process action 204. 



In regard to the segmenting of each frame of a panoramic video, it is noted 
25 that any segmentation pattern can be employed as long as the same pattern is 
used for each frame. For example, the panoramic video frame depicted in FIG. 3A 
is divided into square regions. However, it is noted that the present invention is not 
limited to the depicted segmentation pattern. Rather any pattern desired can be 
employed. 

30 

In regard to the encoding of the panoramic frame segments, each frame 
segment is appended with an identifier. This identifier at a minimum identifies what 
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frame and what frame segment "location" (i.e., what region of the panoramic frame) 
the accompanying image data relates to. A separate file can be created for each 
video stream corresponding to a certain frame segment region. Alternately, one file 
could be created for all segment regions, with separate frames and frame 
5 segments being identified by the aforementioned identifiers. FIG. 3A depicts a 
segmented video frame 300, showing the various segmented regions of the frame, 
300a through 300h. Referring to FIG. 3B, a diagram is provided that conceptually 
illustrates a possible configuration of the above-described encoded file that 
corresponds to the segmented video frame shown in FIG 3A. The file 302 includes 
10 a data structure formed from a plurality of elements or fields. For example, the file 
includes a plurality of elements associated with the image or texture map data 
representing each frame segment of the panoramic video, TD 30 o a through TD3 0 on. 
P Each segment should also have associated with it an index portion or indicator, 
% P3ooa though P 30 oh, identifying the frame and the frame segment locations of the 
15 segment represented. 

3 - 

CP 

4= In regard to compressing the panoramic frame segments, a common way of 

J" compressing video for transfer over a network is by using the MPEG4 compression 
G method. In very general terms, MPEG compression works by comparing 
p 20 successive frames in a series of frames looking for items that are similar in these 
jf frames. Once the similar items have been identified, the MPEG compression 
p algorithm transmits the similar items only once and subsequently only sends the 
differences (or non-similar items) identified. While it would not be possible to use 
this compression method to first compress the frames of the panoramic video and 
25 then attempt to segment them, it is possible to segment the frames first and then 
separately compress each of the series of corresponding segmented regions from 
successive frames. Thus, compressed video streams can be generated from each 
of the corresponding segmented regions of the panoramic video frames. In the 
context of the system and method according to the present invention, if the data 
30 files are compressed, then the corresponding data structure will contain 
compressed data as elements in place of the original uncompressed data. 
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Transfer to the Viewer and Decoding 




Once the panoramic video frames have been segmented, compressed (if 
desired), and encoded, they are ready for transfer to the viewer. This can be 
5 accomplished in a variety of ways, as will now be described. 

Referring to FIG. 4, one way to transfer the files involves an interactive 
approach. This interactive approach employs a network environment. The viewer 
is connected to this network and data is transferred to the viewer via this means. A 
10 frame of a panoramic video is input, as shown in process action 402. Essentially, 
the viewer, such as the one described in the aforementioned co-pending 
application, identifies what portions of the scene the user is currently viewing 
O (process action 404). The viewer then informs a server of the segments of the next 

SI frame of the video that are needed to render the desired view of the scene to the 

m 

M= 15 user as shown in process action 406. The server then causes only those frame 
J segments associated with the requested portions of the next panoramic video 

frame to be transferred to the viewer (process action 408). As shown in process 
action 410, this process is repeated for each frame of the panoramic video. 



D 

si 

O 20 A standing order system can also be implemented. In such a system, as the 

user changes viewpoints within the scene being viewed, different segment regions 
of subsequently transferred frames will be needed to prepare the desired view. In 
such a standing order system the viewer will request the desired frame segments 
and these requested segments are sent for each consecutive frame until a new 
25 request is sent by the viewer. Once a new request is received the server sends a 
new set of requested frame segments until a new order is received, and so on, until 
the entire video has been viewed. 

In order to assist the panoramic video viewer in identifying the desired 
30 segmented regions of each panoramic video frame that should be transferred in the 
case of an interactive network connection, an initialization file approach can be 
employed, as it was in the aforementioned co-pending application entitled "A 
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System and Process for Viewing Panoramic Video". The viewer described in the 
co-pending application, needs certain information to play panoramic videos. The 
initialization file is used to provide the needed information. Essentially, an 
initialization file associated with a panoramic video is sent by the server to the 
viewer prior to the viewer playing the video. In one preferred embodiment, this file 
includes, among other things, pointers or identifiers that indicate how each frame of 
the panoramic video can be obtained. Thus, in the case of the present invention, 
the initialization file would be modified to include identifiers that would indicate how 
to obtain each frame segment of every frame of the panoramic video. Further, the 
initialization file indicates the order in which the frame segments should be played. 
The identifiers would uniquely identify each frame segment of the panoramic video 
and the viewer uses these identifiers to request the desired frame segments. 
Specifically, the viewer determines which portion of the scene depicted in a frame 
of the panoramic video that the person viewing the video wishes to see. It then 
requests only those segments of each panoramic video frame that are needed to 
provide the desired view to the user, in the frame order indicated in the initialization 
file. 

The foregoing interactive embodiment has the advantages of preserving the 
bandwidth utilized when sending data from the server to the viewer since only the 
data actually used by the viewer is transmitted. In addition, the processing and 
storage requirements of the viewer are minimized, as only those portions of each 
frame that are needed have to be decoded, decompressed and stored. 

Of course, in some circumstances an interactive approach will not be desired 
or possible. In such cases, the system and process of the present invention can 
still be advantageously employed. For example, referring to Fig. 5, if the 
previously-described network connect is employed, but without the ability for the 
viewer to request the needed segments from the server, the server transfers the 
initialization file followed by all the segments of every frame to the viewer (process 
action 502). The viewer next determines which portion of the scene captured in the 
panoramic video that the person viewing the video wishes to see (process action 
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504). The viewer then uses the aforementioned identifiers from the initialization file 
to selectively extract those segments of each frame in the incoming data stream 
that are needed to render the desired view (process action 506). Only the 
extracted segments are decoded and decompressed by the viewer (process action 
5 508). Thus, while there is no preservation of network resources, the processing 
and storage requirements of the viewer are still minimized. 

The panoramic video frame segmentation techniques according to the 
present invention also have similar advantages when employed with a direct 
10 connection between the viewer and some type of storage media (e.g., hard drive, 
CD, DVD, and the like) where the panoramic video frame segments are stored, 
since only those portions of the video frame that are needed are read by the 
Q viewer, stored and processed. For instance, referring to Fig. 6 and using the viewer 
J described in the aforementioned co-pending application as an example, the viewer 
M> 15 first reads the initialization file from the storage medium (process action 602). In 
m this case the pointers or identifiers provided in the initialization file identify where on 
J the storage medium the viewer can obtain each segment of every frame of the 
a panoramic video being viewed. The viewer next determines which portion of the 

r; scene captured in the panoramic video that the person viewing the video wishes to 

Q 20 see (process action 604). It then reads and processes only those segments of the 
q current panoramic video frame that are needed to provide the desired view to the 

O user (process action 606). This process is then repeated for each frame of the 

panoramic video in the order indicated in the initialization file (process action 608). 



25 While the invention has been described in detail by specific reference to 

preferred embodiments thereof, it is understood that variations and modifications 
thereof may be made without departing from the true spirit and scope of the 
invention. For example, the system and method described above is not limited to 
just frames of a panoramic video. Rather, it could be employed advantageously 

30 with the transfer and viewing of any image having a size that exceeds that which 
will be viewed. In other words, the system and method according to the present 
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invention could apply to any image where only a portion of a scene or image will be 
viewed on the viewer at any one time. 
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